Annalen der Physik, 1 September 2009 



Network modules help the identification of key transport 
routes, signaling pathways in cellular and other networks 

Robin Palotai* ' and Peter Csermely** ' 

' Semmelweis University, Department of Medical Chemistry, Budapest, Hungary 

Received XX September 2009, accepted XX XXX 2009 

Key words networks, learning, signal transduction, transport 
PACS 89.75.Fb, 89.75.Hc 

Complex systems are successfully reduced to interacting elements via the network concept. Transport plays 
a key role in the survival of networks — for example the specialized signaling cascades of cellular networks 
filter noise and efficiently adapt the network structure to new stimuli. However, our general understanding 
of transport mechanisms and signaling pathways in complex systems is yet limited. Here we summarize the 
key network structures involved in transport, list the solutions available to overloaded systems for relaxing 
their load and outline a possible method for the computational determination of signaling pathways. We 
highlight that in addition to hubs, bridges and the network skeleton, the overlapping modular structure is 
also essential in network transport. Moreover, by locating network elements in the space of overlapping 
network modules and evaluating their distance in this 'module space', it may be possible to approximate 
signaling pathways computationally, which, in turn could serve the identification of signaling pathways of 
complex systems. Our model may be applicable in a wide range of fields including traffic control or drug 
design. 
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1 Introduction 

The network concept is successfully applied to reduce complex systems into a set of interacting elements 
connected by links to examine, understand and predict the topology, dynamics and emergent properties of 
the system [1^]. In most networks the elements are autonomous agents, which not only develop direct in- 
teractions via links, but also establish long-range indirect interactions via various transport processes. The 
necessity of a transport process is usually evoked by a need for communication (social and telecommuni- 
cation networks [5, 6]), transfer of resources (logistic networks, power grids [7, 8]) or regulation ensuring 
a fast, magnified and efficient response (signal transduction networks [9, 10]). The transport process is not 
only an emergent property of the network, but also a significant organizing force behind the structure and 
dynamics of the network. Links of the network may emerge in order to serve the transport process [11] 
and disappear later, if their existence is not required anymore or becomes even harmful [12]. 

Network signaling may be considered as a highly specialized case of transport. Signaling of cellular net- 
works is a system level response to an incoming stimulus and is an extremely selective behavior fine-tuned 
by evolution. It efficiently filters noise-like stimuli, while quickly develops complex signaling cascades in 
response to a recognized stimulus [13, 14]. 

How does a network learn to discriminate between signal and noise? Our own studies [14-17] may 
help us to describe a common scenario: when an unusual signal arrives, which is strong and persistent 
enough to modify network behavior, the network slightly or profoundly disassembles: as a major process 
network modules (groups, communities [3, 18]) become loosely attached with a decreased overlap. When 
the stimulus is over, the network reassembles again. In this phase a large number of inter-modular contacts 
become re-established. However, these inter-modular contacts will not be exactly the same as before the 
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Table 1 Main properites of transport processes 



Property 


Description 


Purpose 


Goal of the transport process, usually related to the fitness or survival 




\JL llCLWUlJv ClClllClILa allli LllC llCLWUlJv as all CllLlLV. 


Sources and sinks 


Specific network elements may be identified as the source or sink of 




a ^IVCll LlallauoiLCU UUallLlLy. 


Information need 


Routing mechanism of network elements determining which neigh- 




boring element will they forward a received quantity to may require 




either local, global or intermediate (mesoscopic) knowledge about the 




network. 


Determinism 


Routing may be deterministic or stochastic. 


Adaptiveness 


Routing may (adaptive process) or may not (static process) be af- 




fected by the dynamic properties of the network. 


Information preservation 


Quantities transported may remain unchanged, suffer distortion or 




even get lost. 


Time 


Transport may be a discrete- or continous-time process. 



stimulus: by developing a structural 'imprint' of the signal, the complex system has now a memory, it 
learned, on one hand which links may be more effective to dissipate the stimulus most efficiently, and 
on the other hand, which links are disturbing this process. If a similar stimulus arrives regularly (or the 
stimulus is large enough that all networks which were unable to learn the reorganization described above 
will disassemble and die) than the newly selected pathway may become dominant and may behave as a 
signaling pathway from then on. 

In Section 2 we enumerate the main properties differentiating transport processes and consider opti- 
mality criteria with an emphasis on network throughput, then summarize the basic structures utilized by 
the network for efficient transport and filtering, namely the network skeleton, hubs (highly connected el- 
ements), bridges (elements connecting sparsly inter-connected network segments) and network modules 
(communities). In Section 3 we propose a method for reconstructing simulated and signaling pathways 
based on overlapping network module information. In Section 4 we summarize our findings and conclude. 

2 Characterization of network transport 

As described in Table 1 summarizing the main properties differentiating transport processes, the goal of the 
transport process is usually related to the survival of the system defining the network and therefore trans- 
port is related to the survival of the network as a connected graph with a large giant component. Transport 
may mobilize resources or information between network elements which are then used for the benefit of the 
system described by the network. For example, a network element in need may propagate a request mes- 
sage and other elements may send resources in response — this signaling scenario could emerge without 
network elements having attributable intentions or desires. 'Transport-provoking' cooperation may emerge 
through evolutionary mechanisms like signaling games [19]. The recently introduced protein games might 
also play a similar role in case of amino-acid networks [17, 20]. 

As for routing, local mechanisms tend to be stochastic because they lack extended information and 
therefore cannot be certain about the effectiveness of any single deterministic choice, while informed 
global routing mechanisms usually favor more deterministic approaches. Cellular networks exhibit local 
routing property, as the transmitted signal can be represented by the propagation of conformational changes 
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Table 2 Expectations toward transport processes 



Expectation 


Description 




V7UClaLlUll yJL LllC LlallaUUlL UlUi^Caa allUUlU aLllVC IKJ tH^lllCVC allU 




maintain its goal. 


^ 1 m nl 1 c^ tv 

O llii L/ J-IL'l L y 


AmontT tr?inQnr\rt nroppQQPQ r\T Qimiliir nprformiiripp fhp r\np wifli 

ZT.iiiwII^ LitHISUwlL LJlVJi^CftftCa Wl Oiliilltli L/dlWlliitllH^C LllC VJllC WlLll 




the simplest mechanism is preferred. 


Robustness 


The transport process should resist network failures, or at least 




degrade gracefully, proportional to network load or damage. 


Stability 


Operation of the transport process should lead to an equilibrium 




state of the network under stable circumstances. 


Fairness 


The transport process should strive to satisfy the transportation 




needs of all network elements equally. 


Optimality 


Operation of the transport process should be optimal for a set of 




criteria. 



of interacting proteins, and such changes may be evaluated locally via means of induced fit, conformational 
selection or protein games [17, 21, 22]. 

Let us take a detour and examine the optimality criterion of Table 2, which lists common expectations 
originally set towards computer networks [23]. It should be noted that an universally applicable set of 
optimality criteria does not exist, mainly due to that different optimality criteria are usually in conflict 
with each other Conventional optimality criteria include low duration of delivery, short delivery paths or 
high transport throughput. In Box 1 we summarize a simple, yet descriptive model of network transport, 
which is sufficiently abstract not to distract attention with implementation details, but still lets us draw 
conclusions about the network transport processes focusing on the criteria mentioned above [1 1, 24-26]. 

For example, in cell signaling networks short delivery paths would be preferred to reduce the distortion 
or loss of information (and indeed, most cellular networks are small-worlds [2,4, 8]), while higher through- 
put would let the signaling network handle more stimuli simultaneously. Unfortunately shorter delivery 
paths increase the load on network elements of high centrality, and this, in turn, lowers the maximum 
possible throughput. As the example described here shows a transport process may conform to differ- 
ent optimization criteria to some extent but not all of them simultaneously. Moreover, if the satisfaction 
of multiple optimization criteria involves an increased complexity of the transport process, this increased 
complexity may hurt our expectations of simplicity, robustness or stability. 

Knowing that the topology of any network sets an implicit upper bound on the maximum possible 
network throughput [27], it is interesting to investigate what kind of measures could the network utilize — 
apart from rearranging or coarsening its link structure [12, 28, 29] — in order to relax overloaded elements 
and prevent congestion. 

First, network elements may exhibit adaptive behavior of taking into account the load of other elements 
in their routing mechanism. This behavior is exemplified by the multiple copies of protein isoforms in 
critical positions of cellular networks, such as the 'critical nodes' defined by Kahn and co-workers [30]. 

Second, network elements may resist to transport more than a given quantity, resulting in a filtered, 
faulty transport process but relaxing the load on the elements of the network. It is not surprising that 
network elements and structures of high centrality (having consequently a high load) are natural candidates 
for such filtering, because these elements of high centrality are expected to constitute a network skeleton 
or superhighway of transport [31] and thus are able to filter excessive amounts of transported quantities. 
Generally, congestion affects most the communication boundaries, such as central hubs of hierarchical 
networks or overlaps of network modules, both providing bridges between different network segments [32] . 
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Box 1 A simple model of network transport 

In the original model of [11] information packets denoted a*'^ are traveling from the source network 
element i to the sink network element k in the network G ~ {N,E). In each discrete timestep first 
R ~ pN new packets a*'"' are generated with i and k chosen randomly and are added to the pool of packets 
at i denoted Qi. Then for each element u a count of packets = C are randomly removed from Q„ and 
forwarded according to the routing strategy, or to the sink k if it is neighboring u (and thus the packet is 
removed from the network). Fig. 1 shows an example scenario. 

This discrete transport model is very flexible: First, both sources and sinks can be identified, however 
this is not strictly necessary for its application. Second, depending on the applied routing strategy, 
the information need may be either local, global or mesoscopic, transport may either be stochastic or 
deterministic, information may be preserved or lost with some probability at each step of routing (see 
Table 1). 

If the routing strategy is static and Markovian (the packet routing is independent of previously visited 
network elements), then both the expected path length and the expected load Bu of any element u (called 
effective betweenness) can be analytically derived. Moreover, if we define congestion as a state where exists 
an element u in the network with the packet pool Q„ growing faster than the processing capacity C„ of 
that element, then the throughput of the transport process can be characterized with the maximum R = 
value without congestion, given by Rc = min{C„iV(iV — 1)/Bu}. Note that if any Cu = C then the 
network throughput is capped by the element of maximum effective betweenness. 




a) b) c) 

Fig. 1 An example scenario for the simple model of Box 1. a) The network element A and its neighboring elements 
B, C and D are shown. The packet pool of element A is shown containing a packet denoted B-C with source element B 
and sink element C. b) Generation step: The new packet A-D is generated at element A with random sink element of D. 
c) Transport step: A random packet (now packet B-C) is selected from the packet pool of A and becomes forwarded 
according to the routing mechanism. As packet B-C reaches its sink element C, it will disappear from the network. 

If we define modules as having more intra-modular links than inter-modular [33], then modules themselves 
also act as noise traps with noise rather circulating inside the module and eventually getting dissipated 
instead leaving the module. 

Third, the routing mechanism may decide to sacrifice certain optimization criteria in favor of network 
throughput by deliberately utilizing alternative or back-up routes to some extent in parallel with the network 
skeleton. This procedure is not necessary adaptive, for example (overlapping) network module information 
may serve as a basis for static routing if known [34, 35]. 

3 Simulated pathways on overlapping modules 

Recent advances in network module identification methods are not only able to assign elements to multiple 
overlapping modules but also provide metrics describing membership strength of any elements to different 
modules [18,36-39], effectively locating network elements in the A/-dimensional module space, M being 
the number of modules in the network. Therefore it is possible to evaluate a structural compatibility 
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Fig. 2 An illustrative network with three overlapping modules (marked with ellipses) is shown. Small circles denote 
network elements, links are not visible. One observed and two simulated pathways connecting a source element (SO) 
with a sink element (SI) are shown: the actually observed pathway (continuous line), a module-based shortest path 
with distance metric calculated in module space as described in Section 3 (dashed line) and a traditional shortest 
path (dotted line). Thin dotted lines indicate the distance between elements of simulated pathways and the observed 
pathway. In the scenario of the illustrative figure, the module-based shortest path approximates the observed pathway 
better than the traditional shortest path. 

between network elements based on the distance of elements in the module space. This fact, combined with 
the observations mentioned in Section 2 that 1) propagation of transmitted signals in cellular networks can 
be evaluated via local compatibility metrics between network elements and 2) information on overlapping 
network modules may serve as a basis for routing, raises the question if, module-based simulated pathways 
between network elements are correlated to actual pathways observed in the network. 

In order to decide, if the observed and simulated pathways are correlated, one may compare observed 
pathways p, traditional shortest paths psp and module-based shortest paths pmsp- in the latter case the 
distance du.v of network elements u and v is the Manhattan distance |5„[fc] — where bi is a 

vector of AI components with bi [k] being the membership strength of network element i to the module k. 

To compare an observed pathway p with a simulated pathway p, one may calculate the distance d{p,p) 
by summing the d{up,p) distances between elements up G p and the pathway p, where d{up,p) is the 
minimum distance between element up and any element w £ p. Finally the normalized d'{p,p) is intro- 
duced as d{p, p) I \p\, where \p\ denotes the number of elements in p. Fig. 2 shows an illustrative scenario 
of pathway comparison. 

If d' {p,pmsp) would prove to be generally lower than d'{p,psp), then we could conclude that module- 
based shortest paths are better approximators of actual network pathways than shortest paths. 

Modules often correspond to various functions of the system coded by the network. Therefore, the 
modular analysis described above may also help us to determine key signaling pathways as inter-modular 
routes. This becomes especially likely, if we take into account the hierarchical structure of modules, where 
modules of the original layer are represented as elements of the next layer of hierarchy [35, 37]. 

4 Summary 

We have described that transport processes are significant organizing forces of the network structure and 
dynamics, and considered a mechanism of network signaling filtering noise and adapting to newly recog- 
nized stimuli. We investigated the main properties differentiating transport processes, listed expectations 
towards transport processes and noted that an universally applicable set of optimization criteria does not 
exist due to criterion-conflicts. We examined the limits of optimizing network transport for highest possi- 
ble throughput in the framework of a simple, yet descriptive model of network transport and described the 
ways how different network structures could cause, and, interestingly, also relax congestion. 
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We highlighted the role of overlapping network modules and proposed that exploiting the information on 
overlapping modules, for example the distance between network elements in the 'module space', may help 
the analysis of routing mechanisms. Finally, we asked the question if module-based simulated pathways 
between network elements are correlated to real pathways observed in the network, and suggested a method 
for determining the answer 

If modul-based pathways would describe well the real, observed pathways, the identification of key, 
signaling pathways of complex systems would become possible using higher layers of the hierarchical 
modules. Such knowledge could be utilized in a wide range of fields including traffic control or drug 
design. 
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